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Abstract 

; 

Gossip algorithms for distributed computation are attractive due to their simplicity, distributed nature, and robustness in noisy 
and uncertain environments. However, using standard gossip algorithms can lead to a significant waste in energy by repeatedly 
£N| ■ recirculating redundant information. For realistic sensor network model topologies like grids and random geometric graphs, the 

— ' inefficiency of gossip schemes is related to the slow mixing times of random walks on the communication graph. We propose and 

analyze an alternative gossiping scheme that exploits geographic information. By utilizing geographic routing combined with a 
j- . , simple resampling method, we demonstrate substantial gains over previously proposed gossip protocols. For regular graphs such 

as the ring or grid, our algorithm improves standard gossip by factors of n and y/n respectively. For the more challenging case 
of random geometric graphs, our algorithm computes the true average to accuracy e using 0(-^== loge -1 ) radio transmissions, 

which yields a y^ l0 g n factor improvement over standard gossip algorithms. We illustrate these theoretical results with experimental 
comparisons between our algorithm and standard methods as applied to various classes of random fields. 

Index Terms 

i Gossip algorithms; sensor networks; message-passing algorithms; aggregation problems; consensus problems; distributed signal 

processing; random geometric graphs. 

I. Introduction 

Consider a network of n sensors, in which each node collects a measurement in some modality of interest (e.g., temperature, 
\ light, humidity). In such a setting, it is frequently of interest to solve the distributed averaging problem: namely, to develop 
£f) • a distributed algorithm by which all nodes can compute the average of the n sensor measurements. This problem and its 
j connection to Markov chain mixing rates has been studied for over thirty years [10], [11]. It has been the focus of renewed 
1 interest over the past several years, motivated by various applications in sensor networks and distributed control systems. Early 
f**«« \ work [10] studied deterministic protocols, known as consensus algorithms, in which each node communicates with each of its 
neighbors in every round. More recent work (e.g. [12], [13]) has focused on so-called gossip algorithms, a class of randomized 
J> \ algorithms that solve the averaging problem by computing a sequence of pairwise averages. In each round, one node is chosen 
k>( ' randomly, and it chooses one of its neighbors randomly. Both nodes compute the average of their values and replace their own 
r> | value with this average. By iterating this pairwise averaging process, the estimates of all nodes converge to the global average 
■ under suitable conditions on the graph topology. 

The averaging problem is an archetypal instance of distributed signal processing, in which the goal is to achieve a global 
objective (e.g., computing the global average of all observations) based on purely local computations (in this case, message- 
passing between pairs of adjacent nodes). Although distributed averaging itself is a very specialized problem, effective 
averaging problems provide a useful building block for solving more complex problems in distributed signal processing. 
Indeed, any averaging algorithm can be easily converted into a general algorithm that computes any linear projection of 
the sensor measurements, assuming that each sensor knows the corresponding coefficient of the projection vector. Recently, 
such algorithms have been proposed for various problems of distributed computation in sensor networks, including distributed 
filtering, detection, optimization, and compression [2], [3], [14], [15]. 

A fundamental issue — and the primary focus of this paper — is how many iterations it takes for any gossip algorithm to 
converge to a sufficiently accurate estimate. These convergence rates have received significant attention in recent work [6], 
[8], [12], [13], [16]— [19]. The convergence speed of a nearest-neighbor gossip algorithm, known as the averaging time, turns 
out to be closely linked to the spectral gap (and hence the mixing time) of a Markov matrix defined by a weighted random 
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walk on the graph. Boyd et al. [16] showed how to optimize the neighbor selection probabilities for each node so as to find 
the fastest-mixing Markov chain on the graph. For certain types of graphs, including complete graphs, expander graphs and 
peer-to-peer networks, such Markov chains are rapidly mixing, so that gossip algorithms converge very quickly. 

Unfortunately, for the graphs corresponding to typical wireless sensor networks, even an optimized gossip algorithm can 
result in very high energy consumption. For example, a common model for a wireless sensor network is a random geometric 
graph [20], in which all nodes are placed uniformly at random in an area and can communicate with neighbors within some 

fixed radius r > 0. With the transmission radius scaling in the standard way [20] as r(n) — 0(y ^p0> even an optimized 




gossip algorithm requires 0(n 2 ) transmissions (see Section Hl-Dt . which is of the same order as the energy required for every 
node to flood its value to all other nodes. This problem is noted by Boyd et al. [16]: "In a wireless sensor network, Theorem 6 
suggests that for a small radius of transmission, even the fastest averaging algorithm converges slowly", and this limitation is 
intrinsic to standard gossip algorithms applied to such graphs. Intuitively, the nodes in a standard gossip protocol are essentially 
"blind," and they repeatedly compute pairwise averages with their one-hop neighbors. Information diffuses slowly throughout 
the network — roughly moving distance \fk in k iterations — as in a random walk. 

Accordingly, the goal of this paper is to develop and analyze alternative — and ultimately more efficient — methods for solving 
distributed averaging problems in wireless networks. We leverage the fact that sensor nodes typically know their locations, 
and can exploit this knowledge to perform geographic routing. Localization is itself a well-studied problem (e.g., [5], [9]), 
since geographic knowledge is required in numerous applications. With this perspective in mind, we propose an algorithm that, 
like a standard gossiping protocol, is randomized and distributed, but requires substantially less communication by exploiting 
geographic information. The idea is that instead of exchanging information with one-hop neighbors, geographic routing can be 
used to gossip with random nodes who are far away in the network. The bulk of our technical analysis is devoted to showing 
that the resulting rapid diffusion of information more than compensates for the extra cost of this multi-hop routing procedure. 

In effect, routing to far away neighbors creates an overlay communication network that is the complete graph, where an 
edge is assigned a cost equal to the number of hops on the route between the two nodes. For graphs with regular topology, 
it is relatively straightforward to see how this additional cost is offset by the benefit of faster convergence time. Indeed, two 
such examples, the cycle and the grid, are analyzed in Section HU where we show gains of the order n and y/n respectively. 
The more surprising result of this paper is that, by using a simple resampling technique, this type of benefit extends to random 
geometric graphs — a class of networks with irregular topology that are commonly used as a model of sensor networks formed 
by random deployments. 

The remainder of this paper is organized as follows. In Section HU we provide a precise statement of the distributed averaging 
problem, describe our algorithm, state our main results on its performance, and compare them to previous results in the literature. 
In Section [TTTJ we analyze the performance of our algorithms on two simple regular network topologies, the cycle and the grid. 
Section |IV] provides the proofs of our result for the random geometric graph model. In Section [V] we provide a number of 
experimental results that illustrate and complement our theoretical analysis. 

II. Problem formulation and main results 

In this section, we first formulate the distributed averaging problem in sensor networks and then describe our algorithm and 
main analytical results. We conclude with an overview and comparison to related work. 

A. Problem statement 

We begin by formulating the problem of distributed averaging and specifying the technical details of our time and commu- 
nication models. 

1) Distributed averaging: Consider a graph G with vertex set V = {1, . . . , n} and edge set E C V x V. Suppose that at 
time k — 0, each node s £ V is given a real- valued number x s (0) £ K, representing an observation of some type. The goal 
of distributed averaging is to compute the average x avc : = ^ S"=i x s[0) at all nodes of the graph. Consensus and gossip 
algorithms achieve this goal as follows: at each time slot k = 0, 1, 2 . . ., each node s — 1, . . . , n maintains an estimate x s (k) 
of the global average. We use x(k) to denote the n-vector of these estimates; note that that the estimate at different nodes 
need not agree (i.e., x s (k) is in general different from x t (k) for s ^ t). The ultimate goal is to drive the estimate x(k) to the 
vector of averages x aV el> where 1 is an n-vector of ones. 

For the algorithms of interest to us, the quantity x(k) for k > is a random vector, since the algorithms are randomized in 
their behavior. Accordingly, we measure the convergence of x(k) to x(0) in the following sense [12], [16]: 

Definition 1: Given e > 0, the e-averaging time is the earliest time at which the vector x(k) is e close to the normalized 
true average with probability greater than 1 — e: 





(1) 



where || • || denotes the £2 norm. Note that this is essentially measuring a rate of convergence in probability. 
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2) Asynchronous time model: We use the asynchronous time model [16], which is well-matched to the distributed nature of 
sensor networks. In particular, we assume that each sensor has an independent clock whose "ticks" are distributed as a rate A 
Poisson process. The inter-tick times are exponentially distributed, independent across nodes, and independent across time. We 
note that this model can be equivalently formulated in terms of a single global clock ticking according to a rate nX Poisson 
process. By letting denote the arrival times for this global clock, then the individual clocks can be generated from the global 
clock by randomly assigning each Zk to the sensors according to a uniform distribution. On average, there are approximately 
n global clock ticks per unit of absolute time (an exact analysis can be found in [16]). However, our analysis is based on 
measuring time in terms of the number of ticks of this (virtual) global clock. Time is discretized, and the interval [Zk, Zk+i) 
corresponds to the fcth timeslot. We can adjust time units relative to the communication time so that only one packet exists 
in the network in each time slot with high probability. Note that this assumption is made only for analytical convenience; 
in a practical implementation, several packets might co-exist in the network, but the associated congestion control issues are 
beyond the scope of this work. 

3) Communication cost: We compare algorithms in terms of the amount of communication required. We will assume a 
fixed communication radius and hence the number of one-hop radio transmissions is proportional to the total energy spent for 
communication. More specifically, let R(k) represent the number of one-hop radio transmissions required for a given node to 
communicate with some other node in the interval [Zk, Z^+i). In a standard gossip protocol, the quantity R(k) = R is simply 
a constant, whereas for our protocol, R(k) will be a random variable (with identical distribution for each time slot). The total 
communication cost, measured in one-hop transmissions, is given by the random variable 

T avo (n, e ) 

C(n,e)= J2 R ^ ■ 
fe=i 

In this paper, we analyze mainly the expected communication cost, denoted by £ (n, e), which is given by 

£(n,e) = E[R(k)]T ave (n,e) . (3) 
Our analysis also yields probabilistic upper bounds on the communication cost C{n, e) of the form 

P{c(n,e)>/(n,e)}< | . (4) 

4) Graph topologies: This paper treats both standard graphs with regular topology, including the single cycle graph and 
regular grid as illustrated in panels (a) and (b) respectively of Figure Q] and an important subclass of random graphs with 
irregular topologies, namely those formed by random geometric graphs [20]. The random graph model has been used in previous 
work on wireless sensor networks [16], [21]. More precisely, the random geometric graph G{n,r) is formed by choosing n 
sensor locations uniformly and independently in the unit square, with any pair of nodes s and t is connected if and only if 
their Euclidean distance is smaller than some transmission radius r. A sample from this random graph model is illustrated 
in Figure QI C )- It is well known [20]-[22] that in order to maintain connectivity and minimize interference, the transmission 

radius r(n) should scale like 0(y^jp*j. For the purposes of analysis, we assume that communication within this transmission 
radius always succeeds^] 

B. Proposed Algorithm 

The proposed algorithm combines gossip with geographic routing. The key assumption is that each node s knows its own 
geographic location within some compact subset C C K 2 , specified as a Euclidean pair (x s , y s ) € C. For the regular grid and 
random geometric graphs, we take C to be the unit square [0, 1] x [0, 1], whereas for the single cycle graph we take C to be 
the unit circle S 1 . In addition, each node can learn the geographic locations of its one-hop neighbors (i.e., vertices t £ V such 
that (s,t) £ E) using a single transmission per node. 

Geographic Gossip Algorithm: Suppose the j-th clock tick Zj is assigned to node s at location £(s). The following events 
then happen: 

(1) Node s activates and chooses a point y = (2/1,2/2) uniformly in the region C, referred to as the target location. Node s 
forms the tuple m s — (x s {j),(-{s),y). 

(2) Node s sends m s to its one-hop neighbor t £ N(s) closest to location y. This operation continues in a recursive manner: 
when a successive node r receives a packet m s , it relays the packet m s to its one-hop neighbor closest to location 
y. Greedy geographic routing terminates when a node receives the packet and has no one-hop neighbors with distance 
smaller to the random target that its own. Let v be the node closest to location y. 

(3) Node v makes an independent randomized decision to accept m s . If the packet is accepted, v computes its new value 
x v(j + 1) = ^( x v{j) + x s (j)) and generates a message m v — (x v (j),£(v),£(s)), which is sent back to s via greedy 

'However, we note that our proposed algorithm remains robust to communication and node failures. 
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(a) Cycle (b) Grid (c) Random Geometric Graph 



Fig. 1. Illustration of a various graphs (nodes as circles and edges as solid lines) and the associated Voronoi regions associated with each node (dotted lines), 
(a) Cycle graph, (b) Regular grid, (c) Random geometric graph. 



geographic routing. Node s can then compute its new value x s (j + 1) = (x v (j) + x s (j))/2, and the round ends. If the 
packet is rejected, then v sends a rejection message to s. 
(4) If v rejects the packet from s, then v chooses a new point y' uniformly in the plane and repeats steps (1II-B1 |- (1II-BI) with 

message m' s = (x s (j),£(s),y'). 

At a high level, the motivation of the geographic gossip algorithm is to exploit geographic information (via the greedy 
routing protocol described in step (2)) to create a new communication graph G' = (V,E') as an overlay of the original graph 
G = (V,E). Note that the new communication graph G' has the same vertex set, but an expanded edge set (i.e., E' D E). In 
fact, for all of the versions of geographic gossip analyzed in this paper, the extended communication graph G' is the complete 
graph, meaning that (s, i) e E 1 for all s ^ t. In the standard gossip protocol, each gossip round takes two radio transmissions. 
In the new communication graph G', certain edges are more costly in terms of one-hop radio transmissions because of the 
routing required to carry out the communication. On the other hand, the benefit is that the new communication graph G' is 
dense, so that gossiping converges more quickly. Our main result shows that this tradeoff — between the cost of each gossip 
round and the total number of rounds — can lead to favorable reductions in the total number of one-hop radio transmissions. 

C. Overview of main results 

The geographic gossip algorithm is a randomized procedure that induces a probability distribution over the sensor v chosen 
at each round. By construction, the probability of choosing sensor v in step (2) of the geographic gossip algorithm is equal 
to a v , the area of its associated Voronoi region. For certain types of regular graphs, such as the single cycle and regular grid 
shown in panels (a) and (b) of Figure Q] this distribution over Voronoi regions is uniform. In this particularly favorable setting, 
the "randomized" decision of node v in step (3) is simple: it accepts the packet m s with probability one. With this choice, the 
distribution over chosen nodes v is guaranteed to be uniform for these regular graphs. Consequently, it can be shown using 
known results for mixing on the complete graph that the averaging time of geographic gossip T avo (n, e) is 0(nloge _1 ). The 
communication cost given by £(n,e) — E,[R(k)]T avc (n,e), where R(k) = R is the number of single-hop communications 
required in round k of the protocol. By computing the expected value E[i?], it can be shown that the overall communication 
costs for these regular topologies scale as £ (n, e) = 9(n 2 log e _1 ) for the single cycle, and £(n, e) = 9(n 15 log e _1 ) for the 
regular grid. Thus, as derived in Section Hill geographic gossip yields improvements by factors of n and i/n over standard 
gossip for these regular graphs. 

For random geographic graphs, in contrast, the distribution of Voronoi regions is quite non-uniform. Consequently, in order 
to bound the averaging time T avc (n, e), we use in step (3) a rejection sampling scheme previously proposed by Bash et al. [23] 
in order to "temper" the distribution. Given the n-vector a of areas of the sensors' Voronoi regions, we set a threshold r. 
Sensors with cell area smaller than r always accept a query, and sensors with cell areas larger than r may reject the query 
with a certain probability. The rejection sampling method simultaneously protects against oversampling and limits the number 
of undersampled sensors, which allows us to prove that T avc (n, e) = C(nloge _1 ) even for this perturbed distribution. 

Of course, nothing comes for free: the rejection sampling scheme requires a random number Q of queries before a sensor 
accepts. Since the queries are independent, Q is a geometric random variable with parameter equal to the probability of a 
query being accepted. In terms of the number of queries, the total number of radio transmissions for the fcth gossip round is 
R(k) = O (Q ■ G). Therefore if T ave gossip rounds take place overall, the expected of radio transmissions will be £(n, e) = 
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E [Q ■ G ■ T ave (n, e)]. Accordingly, a third key component of our analysis in Section [IV] is to show that the probability of 
acceptance remains larger than a constant, which allows us to upper bound the expectation of the geometric random variable 
Q by a constant. We also establish an upper bound on the maximum value of Q over T avc rounds that holds with probability 
greater than 1 — e/2. 

Putting together the pieces yields our main result for random geometric graphs: the expected cost for computing the average 
with the proposed geographic gossip algorithm is 

£{n,e) = i-^loge- 1 . (5) 

In comparison to previous results on standard gossip for random graphs [16], geographic gossip yields a reduction by a factor 
of J j^ji in the number of one-hop communication rounds. 

We note for some classes of graphs, the rejection sampling may not be necessary, even when the induced distribution is 
not uniform, as long as it is reasonably close to uniform. In particular, if we have a lower bound on the area of a 

Voronoi cell for all sensors, then sampling by area is approximately uniform. If we can obtain a slightly looser bound on the 
deviations of the Voronoi areas, alternative techniques may be able to show that our algorithm will not suffer a performance 
loss without rejection sampling. However, for geometric random graphs, it is difficult to obtain a good lower bound on the 
Voronoi cell size, which is our motivation for applying and analyzing the rejection sampling scheme. 

D. Related work and comparisons 

Boyd et al. [13], [16] have analyzed the performance of standard gossip algorithms. Their fastest standard gossip algorithm 
for the ensemble of random geometric graphs G(n,r) has a e-averaging time [16] T avo (n, e) = B(n °f^a )■ (This quantity 
is computed in section IVA of Boyd et al. [16] but the result is expressed in terms of absolute time uni ts wh ich needs 

to be multiplied by n to become clock ticks). Consequently, for the standard choice of radius r(n) = ©(y^f^) ensuring 

network connectivity, this averaging time scales as 0( log e" 1 ). In standard gossip, each gossip round corresponds to 
communication with only one-hop neighbor and hence costs only one radio transmission which means that the fastest standard 
gossip algorithm will have a total cost £{n) = 8(j^|^ log 1/e) radio transmissions. Therefore, our proposed algorithm saves 

a factor of yj -^-^ in communication energy by exploiting geographic information. 

A number of recent papers [6], [18], [19] have also considered the problem of computing averages in networks. The consensus 
propagation algorithm of Moallemi and van Roy [18] is a modified form of belief propagation that attempts to mitigate the 
inefficiencies introduced by the "random walk" in gossip algorithms. For the single cycle graph, they show improvement 
by a factor of ©(jj^j) over standard gossip. Our results for geographic gossip on the single cycle (see Section UTTb show 
improvement by a factor of O(n) over standard gossip, and hence a factor 0(logn) over consensus propagation. It is not yet 
known how consensus propagation would behave for the random geometric graphs also considered in this paper. Mosk-Aoyama 
and Shah [6] use an algorithm based on Flajolet and Martin [24] to compute averages, and bound the averaging time in terms 
of a "spreading time" associated with the communication graph. However, they only show the optimality of their algorithm 
for a graph consisting of a single cycle, so it is currently difficult to speculate how it would perform on other regular graphs 
or geometric random graphs. Alanyali et al. [19] consider the related problem of computing the average of a network at a 
single node (in contrast to computing the average in parallel at every node). They propose a distributed algorithm to solve this 
problem and show how it can be related to cover times of random walks on graphs. 

III. Analysis for Regular Networks 

In this section, we illustrate the benefits of our geographic gossip algorithm for two simple networks, the ring and the grid, 
both of which are regular graphs. Due to this regularity, the implementation and analysis of geographic gossip turns out to 
be especially simple. More specifically, when these graphs are viewed as contained with the unit disk (ring graph) or the unit 
square (grid graph), then the Voronoi region of each node is equal in area (see Figure [TJ. Consequently, sampling a location 
uniformly in the space is equivalent to sampling a sensor uniformly, and thus the overlay graph created by geographic routing 
(step (2) of the geographic gossip algorithm) is a complete graph with uniform edge weights. In this case, the randomized 
decision rule in step ( 1II-BI ) is not needed — the target v always accepts the message. For the ring, we show that standard gossip 
has a communication cost £ (n, e) for e-accuracy that scales as Q(n 3 loge -1 ), and that geographic gossip can improve this to 
0(n 2 loge -1 ). For the grid, we show that standard gossip has communication cost <d(n 2 log e -1 ), and geographic gossip can 
improve this to 0(r?l 2 loge -1 ). 

A. Analysis of single cycle graph 

The ring network consists of a single cycle of n nodes equispaced on the unit circle (see Figure QI a ))- F° r this simple 
network, we have the following result characterizing the improvement of geographic gossip over standard gossip: 
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Proposition 1: In terms of the communication cost £(n, e) for e-accuracy, geographic gossip yields a f2(n) improvement 
over standard gossip on the single cycle graph. 

Proof: We first compute the communication cost £(n,e) for standard gossip. In standard nearest-neighbor gossip, the 
probability pij that nodes i chooses to average with node j is unless \i — j\ = 1, otherwise it is 1/2. Therefore the matrix 
P = (pij) is a symmetric circulant matrix, generated by the n-vector (0, 1/2, 0, 0, ... , 1/2). Using previous results on standard 
gossip [16], in order to evaluate the performance of standard gossip, we must find the second eigenvalue A2 of the matrix W 
defined by 

D -- 

W = 1 

Note that W is also a circulant matrix, generated by the n-vector (1 — n _1 , (2n)~ 1 , 0, 0, ... , (2n) _1 ). Circulant matrices are 
diagonalized by the discrete Fourier Transform (DFT) matrix, so that the eigenvalues can be computed explicitly as 

1\ 1 fc27T , 

1 + - cos k = 0, 2, . . . , n - 1 . 

n I n n 




Consequently, the second largest eigenvalue is given by 



A 2 (»o = i + i|(-i)^(f)' W ^i + e(»-») 



3- 



Therefore, by a Taylor series expansion, we have logA2(H / ) = 0(n ). Applying previous results [16] on standard gossip, 
we conclude that the e-averaging time of standard gossip is: 



Since each gossip communication costs us one hop, the average number of one-hop transmissions for standard gossip on the 
ring is 

£(n,e) = 9 (Vloge- 1 ) . (6) 

We now show how geographic gossip reduces the number of one-hop transmissions. In geographic gossip for the ring 
network, a source node chooses a random location within the unit circle uniformly at random, which induces a uniform 
distribution over the nodes in the network (see Figure Q2 a ))- It then sends a packet to its target around the ring and they 
exchange values. We think of geographic gossip as running a gossip algorithm on the complete graph with 
i and j. For this graph, we have 

w=(i--]i + Xff r . 



n I n 2 



Calculating the second largest eigenvalue yields \2(W) = 1 — ^ + ^ = 1 — 6(n _1 ), so \og\2(W) — and hence 

T ave (n, e) = (nloge -1 ). By summing over the pairwise distances in the graph, we see that the expected number of one-hop 
transmissions at any round is bounded by 

, rsi 

E[R] = E[R(k)} < - J2( 2 ) = °( n ) ■ 

71 i=l 

Thus, the expected number of transmissions for geographic gossip is given by 

£{n,e) =T avc (n,e)E[i?] = O^loge' 1 ) . (7) 

Comparing equations © and (|7]i yields the claim. ■ 
As demonstrated by this result, for the ring network, using geographic knowledge and routing improves the energy consumption 
as measure in hops by a factor of n. In standard gossip, information from one node diffuses slowly in a ring, taking almost 
n 2 steps to become uniformly distributed. Geographic gossip allows the information from one node in the network to travel 
larger distances at the expense of the routing cost. 
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B. Analysis of regular grid 

We now turn to geographic gossip on the two dimensional grid defined by a collection of n vertices sy located at positions 
(ij 'y/n,j I 'y/n) within the unit square [0, 1] x [0, 1], as illustrated in Figure [He). 

Proposition 2: In terms of the communication cost £(n, e) required to achieve e-accuracy, geographic gossip yields a il(y/n) 
improvement over standard gossip on the regular 2-D grid. 

Proof: The performance of standard gossip on the grid can be calculated using Corollary 1 from Boyd et al. [25], which 
says that the averaging time is given by T avc (n, e) = 6 ( "^(p) )- F° r standard gossip on the grid, the matrix P is simply 
the transition matrix of a random walk on the two-dimensional grid, for which it is known [26] that (1 — A2(P))~ 1 = O(n). 
Consequently, we have T avc (n, e) = (n 2 loge _1 ), so that the average number of one-hop transmissions is 

£(n,e) = e(n 2 loge- 1 ) . (8) 

Now let us turn to geographic gossip. For a regular topology like the grid, the Voronoi cells are all of equal area, so 
in step ( IH-Bl i of the geographic gossip algorithm, the chosen target v simply accepts with probability one. Consequently, 
the number of one-hop communications per round is simply the route length. For a regular 2-dimensional grid, routing the 
message at round k costs E[i2(fc)] = 0(^fn) one-hop transmissions. As we derived for the ring network, the geographic gossip 
algorithm is communicating on an overlay network that is fully connected, so that the number of rounds required scales as 
T ave (n, e) = O (nloge -1 ). Putting the pieces together, we conclude that the total communication cost for e-accuracy using 
geographic gossip scales as 

£(n,e) =0 (n^loge- 1 ) . (9) 

Comparing equations ^ and (O yields the claim. ■ 
Thus, for the regular grid in 2-dimensions, geographic gossip yields a factor of ^/n, savings in the convergence time. The ease 
of our analysis in both of the preceding examples — ring and grid networks — arises from the regularity of the topology, which 
allowed us to either write the transition matrix explicitly or use standard results. The following section is devoted to analysis 
of geographic gossip for random geometric graphs, where we will derive a similar performance improvement. For random 
geometric graphs, in contrast to the regular topologies considered thus far, we will use a non-trivial randomized decision rule 
in step ( III-Bb of the gossip algorithm in order to compensate for irregularities of the graph topology and areas of Voronoi 
regions. 



IV. Analysis for Random Geometric Graphs 

We now turn to an analysis of the number of one-hop communications needed for our algorithm in the case of the random 
geometric graph model. At a high level, our analysis consists of three main steps: 

1) First, we address the number of one-hop transmissions G required to route a packet from node s to the randomly chosen 
target v (see step (2) of the geographic gossip alg orithm ). We first prove that when the connectivity radius of the random 

graphs scales in the standard way as r(n) = 9(y^jp*)> greedy routing always reaches the closest node v to the random 
target with 

one-hop radio transmissions. Note that in practice more sophisticated geographic routing algorithms (e.g., [27]) can be 
used to ensure that the packet approaches the random target when there are "holes" in the node coverage. However, 
greedy geographic routing is adequate for the problem considered here. 

2) As discussed above, when geographic gossip is applied to a graph with an irregular topology (such as a random geometric 
graph), it is necessary to compensate for the irregularity with a non-trivial accept/reject protocol in step (3) of the 
algorithm. Accordingly, our next step is to bound the expected number of rejections experienced by a given sensor s. 

3) The final step is to analyze the number of such gossip rounds needed for the average to converge to within the target 
error. 

We take up each of these factors in turn in the subsections to follow. 



A. Routing in 0(l/r(n)) 

We first address how to choose the transmission radius of the sensors in order to guarantee the network's connectivity and 
the success of greedy geographic routing. 

Lemma 1 (Network connectivity): Let a graph be drawn randomly from the geometric ensemble G(n, r) defined in Sec- 
tion |IFA] and a partition be made of the unit area into squares of side length a(n) — J 2^^. Then the following statements 
all hold with high probability: 



x 



a(n) 



Fig. 2. Ensuring network connectivity. Any node in the dark-shaded center square can communicate with its neighbors in the four adjacent lighter-shaded 
squares if r(n) = \J r Ea(n). 



(a) Each square contain s at least one node. 

(b) If r(n) = J 10 ^p-, then each node can communicate to a node in the four adjacent squares. 

(c) All the nodes in each square are connected with each other. 

Proof: The total number of squares of side length a(ri) is M = 2 1 " g „ • ^ e v i ew these as "bins" into which the n sensors 
are assigned uniformly. Standard results on this random process [22], [28] show that with high probability Q(M log M) sensors 
are sufficient to cover all of the bins, proving (a). 

Figure |2] shows a simple geometric argument for (b) and (c). For r(n) — \/5a(n), a sensor at any position in its square can 
communicate with all sensor in the four squares adjacent to it. ■ 

Lemma 2 (Greedy geographic routing): Suppose that a node target location is ch osen in the unit square. Then greedy 
geographic routing routes to the node closest to the target in C(l/r(n)) = 0(. j^ 2 -^) steps. 



Proof: By Lemma Q~f a), every square of side length a(n) = is occupied by at least a node. Therefore, we can 

perform greedy geographic ro uting by first matching the row and then the column of the square which contains the target, which 
requires at most = 0(^J j^-p-) hops. After reaching the square where the target is contained, Lemma [TJc) guarantees that 

the subgraph contained in the square is completely connected. Therefore, one more hop suffices to reach the node closest to 
the target. ■ 
These routing results allow us to bound the cost in hops for an arbitrary pair of nodes in the network to exchange values. 
In the next section, we analyze a rejection sampling method used to reduce the nonuniformity of the distribution. 

B. Rejection sampling 

As mentioned in the previous section, sampling geographic locations uniformly induces a nonuniform sampling distribution 
on the sensors. Assigning locations to the nearest sensors induces a Voronoi tessellation of the plane, and sensor v is queried 
with probability proportional to the area a v of its Voronoi cell. By judiciously rejecting queries, the sensors with larger Voronoi 
areas can ensure that they are not oversampled. We adopt the rejection sampling scheme proposed by Bash et al. [23]: when 
queried, sensor v accepts the request with probability 

r v = min ( — , 1 ] , (11) 

\a v J 

where r is a predefined threshold. Thus sensors with small Voronoi regions always accept, and sensors with large Voronoi 
regions sometimes reject. 

Given r, the probability q v that sensor v is sampled can be written as: 

min(r, a v ) 



\{t : a t > t}\ -t + Et :Qt <r °t ' 
Here the denominator in expression (fT2l i is the total chance that a query is accepted 



£ t=1 mm(T, a t ) 



P a = y^ j av min ( — , 1 j = \{v : a v > t}\t + ^ a « ■ (13) 



v—l x ' v:a v <r 
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Fig. 3. Graphical illustration of the rejection sampling procedure. The total shaded area is the probability of a query being rejected. The new sampling 
distribution is given by the white histogram, appropriately renormalized. 




Fig. 4. Inscribing circles in Voronoi cells. Construction used in the proof of Lemma fj] 



Let Q denote the total number of requests made by a sensor before one is accepted. 

Figure [3] provides a graphical illustration of rejection sampling on the histogram of Voronoi cell sizes. Rejection sampling 
"slices" the histogram at t, and renormalizes the distribution accordingly. The total area that is sliced off is equal to 1 — P a , 
the probability that a query is rejected. Thus, we see that if r is chosen to be too small, then the probability of rejection 
becomes very large. Lemma [3] addresses this concern — in particular, by establishing that the choice r = 0(n _1 ) suffices to 
keep the rejection probability suitably bounded away from 1, so that the expected number of queries M[Q] remains finite. More 
specifically, we choose t such that 

P(a v <T)=mm(v,j£-j^ , (14) 

where the constants v and /i control the undersampling and oversampling respectively. With this choice of r, the results of 
Bash et al. [23] ensure that no sensor is sampled with probability greater than (1 + fj)/n and no more than vn sensors are 
sampled with probability less than 1/n. The following result establishes that the acceptance probability remains sufficiently 
large: 

Lemma 3: Ler < c < 1/4. For r = cnT 1 , we have P(a„ > r) > 1 — 4c. 

Proof: We use a simple geometric argument to lower bound ¥(a v > r). Consider a node s such that a circle of area r 
it lies entirely within its Voronoi region, as shown in Figure [4] Clearly, such nodes are a subset of those with area larger than 
r. The radius of this circle is r = y/r/ir. Note that r is no more than half the distance to the nearest node. Thus in order to 
inscribe a circle of radius r in the Voronoi region, all other nodes must lie outside a circle of radius 2r around the node. This 
larger circle has area 4r, so 

P(a v > r) > (1 - At)'"- 1 = (1 - 4cn~ 1 )"~ 1 > 1 - Ac . (15) 

Thus, by appropriate choice of c, we can make the acceptance probability arbitrarily close to 1. ■ 
Our next step is to bound the distance between the new sampling distribution q (i.e., after tempering by the rejection sampling 

procedure), and the uniform distribution rt _1 l over acceptance regions. These bounds are used in next section to bound an 

eigenvalue of a matrix associated with the gossip algorithm. 

Lemma 4: For any e > 0, there exists constants /i > and v > such that rejection sampling with parameters (/i, v) 
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ensures that 









q- 


-l 


< (■ 


n 


l 




1 - 


1 


q- 


-1 


< ~r e 




n 


2 Vn 



and 



(16a) 



(16b) 



Proof: Given e > 0, choose v and \i such that v + \i < e and v + \i < e . We then expand and bound the error function 



as 



E 



Qv 

n 



* E 

v:q v <l/n 



Qv 

n 



E 

v:q v >l/n 



q v 

n 



Now we use the properties of rejection sampling from [23]: 



v : q v < 



< 



1 + At 



V?' 



< vn 



(17) 
(18) 



On the set {v : q v > 1/n} we use the first bound and on the set {v : q v < l/n} we use the second bound: 



E 



i 

qv 

n 



I vn — h n 

n 



1 + fi _ 1 
n n 



which is less than e by our choice of v and /i. 
Turning now to the bound (116bl >. we write 

^ 1 - 
q--l 

n 



E 

v:a v <T 



E 

v.a v >r 



i— + n I - 



< ;/n- 

< -(^ + M 2 ) 
n 

1 9 

< -e ■ 

n 



Finally, we need to bound the expected number of rejections and the maximum number of rejections in order to bound the 
expected number of transmissions and total transmission time. Recall that Q is the number of queries that a sensor has to 
make before one is accepted, and has a geometric distribution: 



--t) = p a {i-p a y . 

Lemma 5: For a fixed (//, v), rejection sampling leads to a constant number of expected rejections. 

Proof: The random variable Q is just a geometric random variable with parameter P a , so we can write its mean as: 



(19) 



1 a. 



1 



< 



\{v : a v > t}\t + Y,v.a v <r a v 

1 



(1 — v)Tn 

= o(i) , 

where the final step follows since r = 0(n _1 ) by construction. ■ 
Lemma 6: Let {Qk : k = 1, 2, . . . K} be a set of i.i.d. geometric random variables with parameter P a . For any fixed pair 
(fi, I/), rejection sampling gives 



max Q k = OtlogK + loge 1 ) 

Kk<K 



(20) 



with probability greater than 1 — e/2. 
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Proof: For any integer m > 2, a straightforward computation yields that 

m—l 

P(Q < m - 1) = P - (! - = 1 - (1 - ^a) m 

t=0 

By the i.i.d. assumption, we have 

P(maxg fc <m-1)= [l - (1 - P a ) m ] K 

k 

= [l-oxp(mlog(l-P a ))] K . 



We want to choose m — m(K, e) such that this probability is greater than or equal to 1 — e/2. First set m — ~P iog °i_ P j , 
where p is to be determined. Then we have 

P(maxQ fc < m - 1) = [1 - 1/K»] K . 

k 

We now need to choose p > 1 such that 

[l-1/tf']* >l-e/2, 

or equivalently, such that 

1 - [l - l/K p ] K < e/2 . 

Without loss of generality, let K be even. Then by convexity, we have (1 — y) K > 1 — Ky. Applying this with y = 1/K P , 
we obtain 

1- [l-l/K p ] K < l/K p -\ 
Hence we need to choose p > log(2/e)/ logK + 1 for the bound to hold. Thus, if we set 

then with probability greater than 1 — e/2, all K rounds of the protocol use less than m rounds of rejection. ■ 
C. Averaging with gossip 

As with averaging algorithms based on pairwise updates [16], the convergence rate of our method is controlled by the second 
largest eigenvalue, denoted A 2 (Vt / ), of the matrix 

W: = I+±-[P + P T -D] , 

where D is diagonal with entries Di = (^2" =1 [Pij + Pji})- The (i, j)-th entry of the matrix P is the probability that node i 
exchanges values with node j. Without rejection sampling, = aj, and with rejection sampling, Pjj = qj. With this notation, 
we are now equipped to state and prove the main result of the paper. 

Theorem 1: The geographic gossip protocol with rejection threshold r = 9(n _1 ) has an averaging time 

T avc (n,e) -©(nloge- 1 ) . (21) 
Proof: To establish this bound, we exploit Theorem 3 of [16], which states that the e-averaging time is given by 

r '* P1 = e (iSff) ■ (22) 

Thus, it suffices to prove that \og\2{W) — Cl(l/n) in order to establish the claim. 

The probability of any sensor choosing sensor v is just q v , so that we can write P as the outer product P — lcf . Note that 
the diagonal matrix D has entries 

n n n 

Di = ^2(Pij + Pji) = ^2 io + X] qi = 1 + nqi ■ 

Overall, we can write W in terms of outer products as: 

W=(l- diag(l + nq)) + -L(f f + q l T ) . (23) 
Note that the matrix W is symmetric and positive semidefinite. 
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We claim that the second largest eigenvalue \2(W) = 0(1 — c/n), for some constant c. By a Taylor series expansion, 
this implies that \og\2(W) = 9(n _1 ) as desired. To simplify matters, we transform the problem to finding the maximum 
eigenvalue of an alternative matrix. Since W is doubly stochastic, Perron-Frobenius theory [29] guarantees that its largest 
eigenvalue is one, and has associated eigenvector v\ — n^ 1 / 2 !. Consider the matrix W' — W — Kl 1 T ; using equation d23l ). 
it can be decomposed as 

W' = D' + Q', 
where D' = (I — (2n) _1 diag(l + nq)) is diagonal and 

Q> = J- (%- nHf + (q- n- 1 !)!^) 

is symmetric. 

Note that by construction, the eigenvalues of W' are simply 

\(W') = ^l-±-,\ 2 (W),...,\ n (W) 

On one hand, suppose that Xi(W) > \2(W); in this case, then (1 — —) > \2(W) and we are done. Otherwise, we have 

Xl(W) = \ 2 {W) . 

Note that W' is the sum of two Hermitian matrices - a diagonal matrix and a symmetric matrix with small entries. We can 
therefore apply Weyl's theorem [29, p. 181], to obtain that 

\i(W) < Ai(£>') + Ai(Q') <(l-^-\+ Xt(Q') . 

It is therefore sufficient to bound Ai(Q'). We do so using the Rayleigh-Ritz theorem [29, p. 176], the Cauchy-Schwartz 
inequality, and Lemma [4] as follows: 

Ai(Q') = max y T Q'y 

y:||y||2 = l 

= -!- max y T (l( ( f-n- i r) T + ( ( f-n- i r)l T y 

2n y:||y|| 2 = l 

1 — /T 1 — * / > 1 — *\ r T 1 > 

= — max y l(q — nT 1) y 
n aT-J|s/lla=x 

<- max llylb-llll^-llg-n-^lb-llylb 

n a:||al|2 = l 

< i (l-V^—e 



1 

— e 

n 



Overall we have proved the bound 



M^fl-^+ie. (24) 



We can choose e < 1/4 using Lemma [4] to get the desired bound. ■ 
The preceding theorem shows that by using rejection sampling we can bound the convergence time of the gossip algorithm. 

We can therefore bound the number of radio transmissions required to estimate the average. 

Coro llary 1: The expected number of radio transmissions required for our gossip protocol on the geometric random graph 

G(n, y^p) is upper bounded by 

£(n,e)=0 ^^loge" 1 . (25) 

Moreover, with probability greater than 1 — e/2, the maximum number of radio transmissions is upper bounded 

C(n,e) = o(^(n 7 e)[\ogn + loge- 1 ]^j . (26) 
Remark: Note that for e = vT a for any a > 0, our bounds are of the form £(n, l/n a ) — 0(n 3 / 2 y/logn) and C(n, e) = 

0( n 3/2 lQg 3/2 n) 
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Fig. 5. Estimation accuracy versus total spent energy for a linearly varying field. 



Proof: We just have to put the pieces together. If we assume an asynchronous protocol, the cost per transmission pair 
is given by the product of 0(\fn/ logn) from routing, E[Q] from rejection sampling, and the averaging time T ave . From 
Lemma|5l E[Q) = 0(1). Using equation <|22) and TheoremG] we can bound log A 2 (W0 _1 by (1 - \ 2 {W)) = ©(n^ 1 ). Thus, 
the expected number of communications is 



O 



logn 



E [Q]n log 



-- 1 1 =o 



3/2 \ 

= loge~ 1 
ogn ) 



To upper bound the maximum number of transmissions with high probability, we note that Lemma [6] guarantees that 

max Q k = C(logT avc + loge -1 ) 

fe=l,...,T. ve 



(27) 



(28) 



with high probability. Using Theorem Q] we can see that 0(logT avc + loge 1 ) = C(logn + loge 1 ). Consequently, with 
probability greater than 1 — e/2, 



C(n,e) = 0[£(n 7 e)[\ogn + loge- 1 ] 



(29) 



V. Simulations 

Note that the averaging time is defined in equation ([TJ is a conservative measure, obtained by selecting the worst case initial 
field x(0) for each algorithm. Due to this conservative choice, an algorithm is guaranteed to give (with high probability) an 
estimated average that is e close to the true average for all choices of the underlying sensor observations. As we have theoretically 
demonstrated, our algorithm is provably superior to standard gossiping schemes in terms of this metric. In this section, we 
evaluate our geographic gossip algorithm experimentally on specific fields that are of practical interest. We construct three 
different fields and compare geographic gossip to the standard gossip algorithm with uniform neighbor selection probability. 
Note that for random geometric graphs, standard gossiping with uniform neighbor selection has the same scaling behavior as 
with optimal neighbor selection probabilities [16], which ensures that the comparison is fair. 

Figures [5] through [7j illustrate how the cost of each algorithm behaves for various fields and network sizes. The error in the 



average estimation is measured by the normalized £ 2 norm 



Royir 



an 



On the other axis we plot the total number of radio 



transmissions required to achieve the given accuracy. Figure [5] demonstrates how the estimation error behaves for a field that 
varies linearly. In Figure [6] we use a field that is created by placing temperature sources in the unit square and smooth the 
field by a simple process that models temperature diffusion. Finally, in Figure [7] we use a field that is zero everywhere except 
in a sharp spike in the center of the field. For this case, geographic gossip significantly outperforms standard gossip as the 
network size and time increase, except for large estimation tolerances (e w 10 _1 ) and small number of rounds. 

As would be expected, simple gossip is capable of computing local averages quite fast. Therefore, when the field is 
sufficiently smooth, or when the averages in local node neighborhoods are close to the global average, simple gossip can 
generate approximate estimates that are closer to the true average with a smaller number of transmissions. For these cases, 
however, it is arguable that finding the global average is not of substantial interest in the first place. In all our simulations, 
the energy gains obtained by using geographic gossip were significant and asymptotically increasing for larger network sizes, 
corroborating our theoretical results. 
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Fig. 6. Estimation accuracy versus total spent energy for a smooth field modeling temperature. 




Fig. 7. Estimation accuracy versus total spent energy for a field which is zero everywhere except in a sharp spike. 



VI. Conclusions 

In this paper, we proposed and analyzed a novel message-passing algorithm for computing averages in networks in a 
distributed manner. By exploiting geographic knowledge of the network, our geographic gossip algorithm computes the 
averages faster than standard nearest-neighbor gossip. Even if the specific type of geographic routing considered here cannot 
be performed, similar gossip algorithms could be developed for any network structure that supports some form of routing to 
random nodes. Thus, our nearest-neighbor gossip can be understood as a particular case of a more general family of algorithms 
in which message-passing occurs on the overlay network supported by random routing. Other routing protocols may produce 
different overlay networks that could be analyzed in a similar manner. 

In this paper, we analyzed in detail the case of certain regular graphs, including the ring and grid networks, as well as the 
random geometric graph model, which is commonly used as a model of sensor networks under random deployments. Our 
algorithm can also be applied to other topologies that realistically model wireless sensor networks, and should provide gains 
when (a) the mixing time of a random walk on the graph is slow (b) efficient routing is possible, and (c) uniform sampling 
over space can yield approximately uniform sampling over sensors. 

Although the current work has focused on the averaging problem, it is worth noting that many more complicated functions of 
interest can be computed using gossip; see the papers [2], [14], [30], [31] for various examples involving localization, Kalman 
filtering and sensor fusion. However, linear operations (such as filtering) can be computed using our algorithm by allowing 
the sensors to pre-scale their observations by their coefficients in the objective function. Our results suggest that geographic 
gossip may be useful instead of standard nearest-neighbor gossip to improve energy consumption in these and other distributed 
signal processing applications. 
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